Speech Recognition Using a Discriminative , Context - Independent , Segment - Based SpeechRecognizerJan
نویسندگان
چکیده
| In this paper, we describe important improvements that were recently introduced in our Discriminative Stochastic Segment Model (DSSM) speech recognizer. We propose a new presegmen-tation algorithm and we optimize the structure of the Multi-Layer Perceptron (MLP) that estimates the phone probabilities. Additionally, we describe a cascade MLP combination technique that relaxes the drawbacks of traditional stochastic segment models. The proposed improvements have resulted in a statistically signiicant increase of the speaker-independent continuous phone recognition performance on the TIMIT corpus.
منابع مشابه
Allophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملContext-Dependent Modeling in a Segment-Based Speech Recognition System
The goal of this thesis is to explore various strategies for incorporating contextual information into a segment-based speech recognition system, while maintaining computational costs at a level acceptable for implementation in a real-time system. The latter is achieved by using context-independent models in the search, while contextdependent models are reserved for re-scoring the hypotheses pr...
متن کاملContext - Dependent Modeling in a Segment - BasedSpeech Recognition
The goal of this thesis is to explore various strategies for incorporating contextual information into a segment-based speech recognition system, while maintaining computational costs at a level acceptable for implementation in a real-time system. The latter is achieved by using context-independent models in the search, while context-dependent models are reserved for re-scoring the hypotheses p...
متن کامل